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O ! Abstract 

Gowers [Gow98, GowOl] introduced, for d > 1, the notion of dimension-d uniformity U d (f) 
of a function / : G — > C, where G is a finite abelian group. Roughly speaking, if a function has 
small Gowers uniformity of dimension d, then it "looks random" on certain structured subsets 
' of the inputs. 

• We prove the following "inverse theorem." Write G = G\ X ■ • ■ X G„ as a product of 

groups. If a bounded balanced function / : G\ x ■ • ■ G„ — > C is such that U d (f) > e, then 
one of the coordinates of / has influence at least e/2°^ d \ Other inverse theorems are known 
[Gow98, GowOl, GT05, Sam05], and U 3 is especially well understood, but the properties of 
functions / with large U d (f), d > 4, are not yet well characterized. 

The dimension-d Gowers inner product ({fs})u d °f a collection {/s}sc[ti] of functions is a 
related measure of pseudorandomncss. The definition is such that if all the functions fs are 
' equal to the same fixed function /, then ({fs})u d = U d (f). 

We prove that if fs ■ G\ X • • • X G n — ► C is a collection of bounded functions such that 
l({/s})t/ d l ^ e an d a t least one of the fs is balanced, then there is a variable that has influence 
at least e 2 /2°( d ) for at least four functions in the collection. 

Finally, we relate the acceptance probability of the "hypergraph long-code test" proposed 
I/"") . by Samorodnitsky and Trevisan to the Gowers inner product of the functions being tested and 

we deduce the following result: if the Unique Games Conjecture is true, then for every q > 3 
there is a PCP characterization of NP where the verifier makes q queries, has almost perfect 
completeness, and soundness at most 2q/2 q . For infinitely many q, the soundness is (q + l)/2 q . 

S Two applications of this results are that, assuming that the unique games conjecture is true, 
it is hard to approximate Max fcCSP within a factor 2k /2 k (or even (k + l)/2 k , for infinitely 
J> . many k), and it is hard to approximate Independent Set in graphs of degree D within a factor 

1 Introduction 



We return to the study of the relation between number of queries and error probability in proba- 
bilistically checkable proofs. 
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The PCP Theorem [AS98, ALM+98] states that it is possible to encode certificates of satisfiability 
for SAT instances (and, more generally, for every problem in NP) in such a way that a probabilistic 
verifier can check the validity of the certificate with high confidence after inspecting only a constant 
number of bits. We write SAT 6 PCP CjS [r(ra), q] if there is a verifier that uses at most r(n) random 
bits, where n is the size of the formula, accepts encoding of valid proofs with probability at least 
c (the completeness probability of the verifier) and accepts purported encodings of proofs with 
probability at most s (the soundness error of the verifier) if the formula is unsatisfiable. The PCP 
Theorem states that there exists a constant k such that SAT £ PCPr ,1/2 [O (log n), k]. Improvements 
and variants of the PCP Theorem and their applications to the study of the approximability of 
optimization problems are too many to summarize here, and we refer the reader to the chapter on 
hardness of approximation in Vazirani's book [VazOl] and to some recent survey papers [Aro02, 
Fei02, Tre04]. 

In this paper we are interested in the following question: for a given number of queries, what is 
the highest confidence that we can have in the validity of the proof? That is, for a given value of 
q, what is the smallest value s = s(q) for which SAT E PCPi_,s iS+( s[0(logn), q] for every 5 > 0? 
We call this parameter s the soundness of the PCP construction. A good measure of the trade-off 
between the number q of queries and the soundness s is the so-called amortized query complexity, 
defined as q = qj (log 2 s" 1 ). 

A simple argument shows that, unless P = NP, s must be at least 1/2 9 , that is, the amortized 
query complexity must be at least 1. A more careful argument gives a lower bound oi2/2 q [Tre98a] 
on the soundness, which was recently improved to • ^q) by Hast [Has05]. (Hast's result can 

also be stated as giving a lower bound of 1 + (1 — o(l))^p to the amortized query complexity of 

a g-query PCP.) The PCP Theorem shows that we can have s = l/2^ q \ and the authors showed 
that can have s < 2 2 ^ q /2 q [ST00]. (That is, the amortized query complexity can be as low as 
1 + 0(1/ y/q).) Our proof was simplified by Hastad and Wigderson [HW03], and the soundness was 
improved to s < 2^ 2 ~ q /2 q by Engebretsen and Holmerin [EH05]. As we discuss below, 2 e ^) /2 q 
was a natural limit for the soundness achievable with current techniques. 

In this paper, assuming Khot's Unique Games Conjecture [Kho02], we present an improvement to 
s = {q + l)/2 9 . Our analysis is based on a theorem, which is probably of independent interest, 
bounding the Gowers uniformity of a given function in terms of the influence of its variables. 

1.1 Linearity Tests and PCP 

The linearity testing problem captures most of the technical difficulties of the the construction of 
PCP constructions, and it is a good starting point. 

Let us call a function / : {0, l} n — > { — 1,1} linear if it is a homomorphism between the group {0, l} n 
(where the operation is bitwise XOR, written additively) and the group {—1, 1} (where the operation 
is multiplication). Equivalently, / is linear if and only if it can be written as f{x\,... ,x n ) = 
(_l)E lGS ^ f or S ome set S C [n]. We use the notation xs( x ) '■= (—l)^ iesXi ■ 

In the linearity testing problem we are given oracle access to a boolean function / : {0, l} n — > 
{ — 1, 1} and we would like to distinguish between the following extreme settings: 

1. / is linear; 
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2. for every S, the agreement between / and xs is & t most 1/2 + e. 

By agreement between a function / and a function 5 we mean the fraction of inputs on which they 
are equal. We say that a test has error-probability at most e if in case (1) it accepts with probability 
1 and in case (2) it accepts with probability at most e + s', where e' — ► when e — > 0. 

Blum, Luby and Rubinfeld [BLR93] define a very simple such test, that makes only three queries 
into /: 

BLR- Test (f) 

choose x, y uniformly at random in {0, 1}" 
accept if and only if f(x) ■ f(y) = f(x + y) 



Bellare et al. [BCH + 96] give a tight analysis of this test, showing that if it accepts with probability 
at least 1/2 + e, then / has agreement at least 1/2 + e with some linear function. According to our 
above definition, the BLR test has error probability at most 1/2. 

There are at least two ways in which such a result needs to be extended before it can be used in a 
PCP construction. 

First of all, we would like to consider a case where two or more functions are given as an oracle, 
and the test wants to distinguish between the following cases: 

1. The functions given as an oracle are all equal to the same linear function 

2. No two functions have agreement more than 1/2 + e (or less than 1/2 — e) with the same 
linear function 

There is a natural extension of the BLR test to this setting: 

3-functions-BLR-Test (/, g, h) 

choose x, y uniformly at random in {0, 1}™ 
accept if and only if f(x) ■ g(y) = h(x + y) 

Aumann et al. [AHRS01] show that if this test accepts with probability 1/2 + e, then there is a 
linear function xs such that f,g, h have all agreement at least 1/2 + e/3 with either xs ° r — Xs- 

The second change is that, for the sake of PCP constructions, we are especially interested in linear 
functions xs with a small S. We call functions of the form X{i}( x ) = (~^) Xi long codes. In a long 
code test we are given several functions and, in a commonly used definition, we want to distinguish 
the following cases: 

1. The functions given as an oracle are all equal to the same long code; 

2. For every small S, no two functions have agreement more than 1/2 + e (or less than 1/2 — e) 
with xs- 

We say that such a test has error probability at most e if, whenever it accepts with probability 
more than e + e, there are constants e',d, depending only on e, and a set S, \S\ < d, such that at 
least two of the given functions have agreement at least 1/2 + e' with xs or — Xs- 
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A test satisfying this definition can be obtained from the BLR test by adding noise to each query. 
Let /j,s be the probability distribution over {0, l} n defined by picking n independent copies of a 
biased coin that returns with probability 1 — 5 and 1 with probability 5, and consider the following 
test: 

<5-noisy-3-functions-BLR-Test (/, g, h) 

choose x, y uniformly at random in {0, 1}" 

sample 771 , 772 , 7/3 indipendently according to fi$ 

accept if and only if f{x + 771) • g(y + 7/2) = h(x + y + 773) 



It is easy to see that in case (1), that is, when f,g,h are equal to the same long code, then the 
test accepts with probability at least (1 — <5) 3 . Hastad [HasOl] shows 1 that if the test accepts with 
probability at least 1/2 + e, then there is a set S of size at most poly(e , such that f,g,h all 
have agreement at least 1/2 + e/3 with \s or — xs- 

Having a q-query test for this problem with error probability at most e is enough to construct 
a PCP characterization of NP with query complexity q and soundness error about e, provided 
that Khot's unique games conjecture [Kho02] is true. (Further refinements are needed to derive an 
unconditional result in [HasOl].) 

Given this logical path from the basic linearity testing problem to the task of constructing PCPs, 
our plan in [Tre98b, ST98, ST00] was to devise linearity tests with good trade-offs between number 
of queries and error probability, and then "translate" such tests into a PCP construction. 

In [ST00] we devise a linearity tester whose asymptotic trade-off between number of queries and 
error probability is optimal. The test, for k > 2 is defined as follows: 

Complete-Graph- Test (/, k) 

choose xi, . . . , uniformly at random in {0, l} n 
accept if and only if 

for every % / j, f(xi) • f(xj) = f(xt + Xj) 

Note that the test has query complexity q = k+ (2) and runs (2) correlated copies of the BLR test. 
If / is linear, then it is clearly accepted with probability 1. If / has agreement at most 1/2 + e, then 
we already know that each of the (Jj) tests accepts with probability at most 1/2 + e. In [ST00] we 
show that the (2) tests behave almost mutually independently, and the probability that all accept 

is at most 1/2 + e' w l/2 q ~^. We also extended the test to the noisy case, the case of several 
functions, and the setting (which we do not describe in this paper) which is sufficient to derive an 
unconditional PCP characterization of NP. 2 

One might have thought that the following test would have achieved an even better trade-off 
between number of queries and error probability: 

1 Such an analysis is implicit in Hastad's paper, and the result, as stated here, appears explicitely in [AHRS01]. 

2 In this last step, we lost something in the soundness error, which became l/2 9-2 ^, where q is the number of 
queries. This was recently improved to l/2 9-v/ ^, the same bound of the basic linearity test, by Engebretsen and 
Holmerin [EH05]. 
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Complete-Hypergraph-Test (/, k) 

choose xi, . . . ,Xk uniformly at random in {0, l} n 
accept if and only if 

for every S C [fc]: \S\ > 2. UjeS = fCEjesV) 

In the hypergraph test we make q = 2 k — 1 queries and run 2 k — k — 1 correlated copies of the BLR 
test. If the tests behaved almost mutually independently for / far from linear, then the test would 
have error probability ~ (q + l)/2 q . 

Unfortunately, the above statement fails already when k = 3 if we consider the function f(x) = 
^_-±yiX2^ hx n -ix n ^ g uc h a function is very far from linear, but the 7 conditions of the hypergraph 
test for k = 3 are not almost independent. In fact, much more generally, we prove in [ST00] 
that if we run a g-query hypergraph test on /, then the test accepts with probability at least 
2 - g +n(V«). in Section 7 we generalize this result and prove that any linearity test that makes q 
queries and that accepts linear functions with probability > c must accept / with probability at 
least (1 - c) + 2-«+ Q (v^) . 

Even though there is no more room for improvement in the basic linearity testing problem (or its 
multi- function version, which is only harder), there is still hope for PCP, by using a more relaxed 
formulation of the long code test. Several recent papers [DS05, Kho02, KR03, KKMO04, CKK+05, 
KV05] define a long code test based not on closeness to linear functions xs with small S but based 
on the notion of degree-d influence. We will define such a notion later (Section 2). For now, it 
suffices to say that if we can solve a certain (relaxed) variant of the linearity test with a given 
query complexity and error probability, then we can also test the long code with the same query 
complexity and error probability. We say that the relaxed test has error probability e if: 

1. if / is linear, then the test accepts with probability 1; 

2. if the test accepts with probability e + e, then there is a coordinate i that has influence at 
least 8(e) for /. 

Influence (as opposed to "degree-d influence") has a simple definition for boolean functions: the 
influence of i for / is the probability that / is a non-constant function after we randomly fix all 
coordinates of / except the i-th. If / has agreement 1/2 + e with a non-constant linear function 
Xs, then / has variables of influence at least 2e, but there are functions / that have influential 
variables even though they are very far from all linear functions (this is why this test is a relaxation) . 
Intriguingly, in the function f(x) = (— l) 11 ^ 2 " 1 \-x n -ix n a rj variables have influence 1/2 ± o(l). 

As we shall see later, we prove that the complete hypergraph test has error probability only 
1/ ^2 2fc ~ fc_1 ^ with respect to the above definition. That is, if a function / is accepted by the 

test with probability more than 1/ (2 2k - k ~A + e, then one of the variables of / has influence at 

least (5(e). We then generalize the test to the setting of several function (in which case we show 
that at least two functions have an influential variable in common) and to the case of testing the 
long code, resulting in a conditional PCP construction assuming the unique games conjecture. 



5 



1.2 Szemeredi's Theorem and Gowers Norms 

We will use a definition that has been introduced by Gowers [Gow98, GowOl] in his seminal papers 
on a new proof of Szemeredi's Theorem. 

Szemeredi's Theorem states that any subset of the integers of positive density contains arbitrarily 
long arithmetic progressions. The quantitative statement is that for every 5, k there is a n(6, k) 
such that if A is an arbitrary subset of the integers {1, . . . , N}, \A\ > 5N, and N > n(5, k), then 
A must contain an arithmetic progression of length k, that is, there are integers a, b such that 
a, a + 6, . . . , a + {k — 1)6 all belong to A. 

In Szemeredi's original proof, which introduced the famous Szemeredi Regularity Lemma, n(5, k) 
grows very fast with k and <5 : it is a tower of exponentials whose height is a tower of exponentials 
whose height is polynomial in k and 5 _1 . 

The case k = 3 had been settled earlier by Roth, with a simpler analytical proof that gave a 
reasonable (doubly exponential) relation between n(5, 3) and 6" 1 . Roth's proof (like all other 
proofs about arithmetic sequences in dense sets) is based on an iterative argument on 5: if 5 is a 
sufficiently large constant (say, 90%), then a random length-3 progression is contained in A with 
positive probability, and so A contains some length-3 progressions. Otherwise (blurring, for the 
moment, the distinction between progressions mod N and true progressions), one writes the fraction 
of length-3 progressions in A as the number one would expect if A where a random set of density 
5, that is, a 5 3 fraction of all length-3 progressions, minus an error term that equals X/g^o 
the summation of the cubes of the non-zero Fourier coefficients of /a, the characteristic function 
of A. One then considers two cases: if all Fourier coefficients of /a are small, then the error term 
is smaller than S 3 , and so A contains a positive fraction of all length-3 progressions in Z^v- If 
j 'a has a large Fourier coefficient, then one can reduce the task of finding a length-3 progression 
in A to the task of finding a length-3 progression in a certain subset A 1 of {1, ... ,N'} of density 
5' > 5 + f2(<5 2 ). In this reduction, N' is about y/~N, one does no more than 0(5~ 1 ) such reductions, 
so it's enough to start with N = 2 2 ° (1 4 . We mention these technical details because they are 
very similar to the analysis of the basic BLR linearity test in [BCH + 96], where the acceptance 
probability of the verifier is written as \ + \ J2 a y{a), and so (i) if all Fourier coefficients of / are 
small, then ^ / 3 (a0 is small, and the test accepts with probability close to 1/2, while (ii) if one 
Fourier coefficient is large, then / is close to a linear function (this part is trivial). 

A way to look at both proofs is to think of a function / as being "pseudorandom" if all its 
Fourier coefficients are small, and of a set A C [N] to be pseudorandom if all the non-zero Fourier 
coefficients of its characteristic function are small. Then one proves that, for a pseudorandom 
function, the values f(x), f(y), f(x + y) are nearly independent random bits, and so, in particular, 
f(x)f(y) = f(x + y) happens with probability approximately 1/2. For a psedorandom set of density 
S, the events x S A, (x + r) G A, {x + 2r) E A also behave nearly independently for random x, r, 
and they happen simultaneously with probability approximately <5 3 . 

To prove Szemeredi's Theorem for k > 4 one might try to show that a pseudorandom set, as 
defined above, has approximately the expected number of length-/c progression. This, unfortunately, 
does not seem to be true, and sets whose characteristic function is defined in terms of a degree- 
2 polynomial are basic counterexamples even for k = 4. (See [GowOl].) Recall that something 
similar happens in the hypergraph test, where a function defined in terms of a degree-2 polynomial 
is very far from linear (and so all its Fourier coefficients are small), but the tests performed in the 
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hypergraph test do not behave independently. 

Gowers [Gow98, GowOl] resolves this problem by introducing a more refined measure of pseudo- 
randomness. For d > 1 and / : G — > R, where G is a group, he defines the quantity 

n f(*+E x *) 

SC[d] \ i&S / 

and there is a similar definition for / : G — > C, in which all evaluations of /() when \S\ is odd are 
conjugated. (See definitions in Sections 2 and 6.) 

Gowers goes on to prove that if A C 7Ln is a subset of density 5, f : Zn — ► [— 1, 1] is a normalized 
version of the characteristic function, and U d {f) is sufficiently small (as a function of d and S, 
but not of N) then A must contain arithmetic progressions of length d + 1. The foard part of 
Gowers's work is then to show that if U d (f) is large, then finding progressions in A reduces to 
finding progressions in a set A' C Z^r' of density strictly larger than 5. 

Towards this goal, Gowers proves certain structural properties of functions / : Z^r — ► C, N prime, 
having non-trivially large U d value. Such functions are shown to have a certain "local correlation" 
with degree (d — l)-polynomials. Green and Tao [GT05] study functions / : G — > C with large II s , 
and, provided that the order of G is not divided by and 2 and 3, prove a certain "global correlation" 
between such functions and degree-2 polynomials. 3 Samorodnitsky [Sam05] proves such a result 
for functions / : {0, l} n -> R. 

Not much is known about functions / : G — > C having large U d when d > 4 and G is a general 
group. 

1.3 Our Results 

We prove that if / : G\ x • • • X G n — > C is a bounded balanced function and U d (f) > e, then there is 
a variable that has influence at least e/2 0<yd ^ for /. Above, we only defined influence in the boolean 
case, but a more general definition applies to functions mapping from an arbitrary product space 
into the complex numbers. 

Green and Tao introduce a notion related to the U d measures of pseudorandomness. For 2 d functions 
{fs}sc[d], fs '■ G — > R, their Gowers inner product is defined as 

n /*u+2>) 

sc[d] V ies J 

In the case of complex- valued functions, the terms with odd |5| are conjugated. Note that if all 
the functions fs are identical to the same function / then ({fs})u d = U d (f). 
Our second result is that if / : G\ X • • • X G n — ► C are bounded functions such that at least one 
of them is balanced, and {{fs})u d — e > then there is a variable that has influence at least e 2 /2°( d > 
for at least four of the functions in the collection. 

3 The "globality" of this result deteriorates with the order of the group, and a result as stated is false for, say, Zn, 
N prime. 



U d (f) := E 

X,Xl,...,Xd 



({fs})u<t ■■= E 
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Finally, we come back to the analysis of the hypergraph test. Hastad and Wigderson [HW03] 
significantly simplified the analysis of the graph test of [STOO] by using an averaging argument that 
reduces the analysis of the graph test to the analysis of the 3- functions BLR test. 

We apply a similar averaging argument and reduce the analysis of the hypergraph test to the task 
of bounding expressions of the form 

d sc[d\ \ i J 

where d < k and the functions fs ■ {0, l} n — > {—1, 1} are derived in a certain way from the function 
being tested. The expression looks very similar to the expression for the Gowers inner product, 
and in fact it is easy to see that if the Expression in (1) is large, then the Gowers inner product of 
a related set of functions is also large. From our results, it follows that at least four of these new 
functions must share an influential variable, from which it follows that two of fs must share an 
influential variable and so the function being tested has an influential variable. (See Section 7.) 

This analysis easily extends to the case of testing multiple functions and to the noisy case. We 
present the analysis of the noisy, multi- function case in Section 8. 

This leads, under the unique games conjecture, to a PCP characterization of NP with q queries 
and error probability (q + l)/2 g , if q is of the form 2 k — 1. As corollary, we obtain (q + l)/2 9 
hardness of approximation for qCSP and (poly log D)/D hardness of approximation for independent 
set in graphs of maximum degree D. In terms of amortized query complexity, we show that 

1 + (1 + o(l))^p is achievable assuming the unique games conjecture, and Hast's algorithm [Has05] 
implies that this is tight up to the lower order term. 

1.4 Organization of the paper 

We develop the basic theory of influence of variables and Gowers uniformity for the case G = 7Li 
in Section 2. We prove our connection between influence of variables and Gowers uniformity in 
the case G = ZJ? in Sections 3, 4, 5. Section 6 shows how to generalize our results to the case of 
general abelian groups; only the result of Section 3 requires a new proof, while the other results 
follow by making syntactic changes in the proof for the boolean case. An application to generalized 
linearity testing is presented in Section 7, together with lower bounds establishing the tightness of 
our analysis. An application to PCP is presented in Section 8. 

The paper ends up being quite long because, for the sake of readability, we first prove some results 
in important special cases before proceeding to the more general statements. A reader interested 
only in the PCP application may skip Section 6. A reader interested only in our results on Gowers 
uniformity may skip Sections 7 and 8. 

2 Preliminaries 

In this section we develop the theory of Gowers uniformity and Fourier transforms for functions 
/ : Z2 — > M. Generalized definitions for the setting of functions / : G\ x • • • x G n — > C will be given 
later in Section 6. 
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For a positive integer n, we use [n] to denote the set {1,2, ... ,n}. We use the convention that [0] 
is the empty set. 

For two reals a < b we use [a, b] to denote the interval {x € R : a < x < b}. 
2.1 Fourier Analysis 

For a subset S C [n], define the function xs '■ {0, l} n -* R as 

Xs(xi,...,x n ) = (-l)£ i6S *i 

We use the convention that an empty sum equals zero, so x$( x ) = 1 f° r every x. 
For two functions /, g : {0, l} n — > R, define their inner product as 

{f, 9) ■= E f{x)g{x) 
xe{o,i} n 

Then it is easy to see that the functions xs are orthonormal with respect to this inner product, 
that is, for every S, 

(XS,XS) = 1 

and, for every S / T, 

(xs,Xt) = 

This implies that the functions xs are linearly independent and, since there are 2 n such functions 
and the space of functions / : {0, l} n — > R has dimension 2 n , it follows that the xs are a basis for 
such space. 

Every function / : {0, l} n — > R can therefore be written as 



f{x) = f(S)xs(z 



SC[n] 

where the coefficients f(S) in the linear combination satisfy 

f(S) = (f,xs) 

and are called the Fourier coefficients of /. The function /() mapping sets S into coefficients f(S) 
is the Fourier transform of /. We will make use of the following equality, called Parseval's identity 
(or Plancherel's identity). 



Y J P(S)=Ef\x) (2) 

S 

In particular, if / : {0, l} n -» [-1, 1], then J2s f 2 ( S ) ^ h and so \ f( S )\ < 1 for ever y S - 



9 



2.2 Influence of Variables 

If / : {0, l} n — > { — 1, 1} is a boolean function, then the influence of i for / is defined as 

lnU(f):=Pr[f(x)^f(x + ei )] (3) 

X 

where e% £ {0, l} n is the vector that has a 1 in the i-ih position and zeroes everywhere else. That 
is, Infj(/) is the probability that, if we pick a random x, the value of / at x is different from the 
value of / at the point that we get by flipping the i-th bit of /. 

It is easy that see that Infj(/) satisfies the following identities. 

InU(f) = \ £(/(*) - f(x + e,)) 2 = £ / 2 (S) (4) 

S:i£S 

For a general real- valued function /:{0,l} n — ► R, we will define the influence of i for / as 

Inf,(/) := - E (/(x) - f(x + e,)) 2 = £ P(S) 

S:i&S 

Note that Equation 3 is not valid if / is a general real-valued function. 

We will make use of the following simple fact, that is valid for every function / : {0, l} n — ► R: if 
S 7^ and i £ S, then 

Infi(/) > f 2 (S) 
and, in particular, for every / : {0, l} n — ► R, 

maxlnfj(/) > max/ 2 (S) (5) 

For boolean functions / : {0, l} n — > { — 1, 1}, the stronger inequality maxjlnfj(/) > max 5 _^0 
holds, but we will not use it. 

For a function / : {0, l} n — ► R, a threshold d < n, and an index i £ [n], we define the degree-d 
influence of i for / as 

Inff (/) := £ / 2 (S) . 

5:iG5,|S'|<d 

We will make use of the following simple fact: if / is a bounded function, then not too many 
variables can have large low-degree influence. Specifically: 

n n 

5>ff(/) = ]T £ P(s)= Yl \s\P(s)<dJ2P(s) 

i=l i=l S:ieS,\S\<d S:ieS,\S\<d S 

In particular, if / : {0,1}" — ► [—1,1], then Y2t=i I nr V*(/) — ^> an d so a ^ mos t <^/£ variables can 
have degree- d influence larger than e. 
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2.3 Cross-Influence 



For the application to PCP, the following definition will be useful. Let T = fx, . . . , fk be a collection 
of (not necessarily distinct) functions /,■ : {0, 1}" — ► M. Then the cross-influence of i for T is defined 

as 

XInfj(/i ... ,f k ) := maxm.m{Infi(fj),Iafi(f h )} (6) 

In other words, XInfj(/i, . . . , fk) > e if and only if there are at least two functions fj,fh, with 
j 7^ h, such that Inii(fj) > e and Inf^/zJ > e. Conversely, XInfj(/i, . . . , fk) < e if there is at most 
one function such that Infj(/j) > e. 

More generally, for a threshold t, we define the Across influence of i for T as 

iXInfi(/i, . . . ,/fc) := max min{Inf , (f h ),..., lnU(f jt )} (7) 

iivij'tS[fc], all distinct 

That is, iXInfj^) is the smallest s such that there are at least t functions fj 1 ,..., fj t in T such 
that coordinate i has influence at least e for all of them. Note that XInf(^") = 2XInf(J r ). 

If all the functions fj are equal to the same function /, then fcXInfj(/i, . . . , fk) = XInfj(/i, . . . , fk) = 
Infi(/). 

Similarly, we define the degree-d cross-influence of T as 

XInfP (A . . . , f k ) := maxmin{InfP (/_,-), Inff d (f h )} (8) 



2.4 Gowers Uniformity 

Definition 1 (Gowers Uniformity) Let f : {0, 1}" -»l k o function, and d > 1 be an integer. 
The dimension-<i uniformity o/ / is defined as 



u d {f) 



E 

IE, 21 



n/ 

5C[d] 



i£5 



Remark 1 we use a terminology and notation that is a hybrid between the one of Gowers 

[Gow98, GowOl] and the one of Green and Tao [GT04, GT05]. What we call dimension-d unifor- 
mity is called degree-(d — 1) uniformity by Gowers, and no notation is introduced for it. Gowers 

also introduces the notation which equals, in our notation, (U d (f)) 1 ^ 2 . Gowers proves that 

|| • \\d is a norm, and he does not give it a name. Green and Tao use the notation \\f\\jjd for 

{U d (f)) 1 ^ 2 , and call it the (dimension-d) Gowers norm. 



Here are expressions for the first few values of d: 
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u\f) 
u 2 (f) 

u 3 (f) 



E f(x)f(x + y) = E/(x) 



x,y 

E f(x)f(x + y)f(x + z)f(x + y + z) 

x,y,z 

E[E f(x)f(x + y) 
y \x 

E f(x)f(x + y)f(x + z)f(x + y + z)f(x + w)f(x + y + w)f(x + z + io)/(a; + y + z + io) 
E ( Ef(x)f(x + y)/(x + z)f{x + y + z)" 

y,z \x 



The above examples suggest the use of the following notation. 

For a function / : {0, l} n — > E and elements xi, {0, l} n , define 



SC[<f] V ieS / 



Then we have 



c/ d (/)= e ./:,, ,,(•<•; 



Define the dimension-d Gowers inner product of a collection {/sjscfdl °f ( n °t necessarily different) 
functions f s : {0, l} n -► R, as 



<|{/s}>^ := E 



sc[d] \ ies / 



(9) 



Note that, in particular, if all the functions fs are equal to the same function / then {{fs})u d 

u d {f). 



2.5 Unique Games 

A unique game [Kho02] is a constraint satisfaction problem such that every constraint is of the 
form y = f x ,y(x), where x,y are variables ranging over a finite set E, which we call the alphabet, 
specified as part of the input, and f XiV : E — ► E is a permutation. Given a unique game, we are 
interested in finding the assignment of values to the variables that satisfies the largest number of 
constraints. 

More formally, a unique game is a tuple (V, E, E, {f x ,y}( x ,y)eE) where V is a set of variables, E is 
a set of pairs of variables (corresponding to constraints), and, for every (x,y) € E, the function 
f X)V : E — > E is a permutation. Note that (V, E 1 ) is a graph, which we call the constraint graph 
of the unique game. We want to find an assignment A : V — > E that maximizes the number of 
satisfied constraints, that is, the number of pairs (x,y) S E such that A{y) = f x y (A(x)). The value 
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of an assignment is the fraction of constraints satisfied by the assignment; the value of a unique 
game is the value of an optimum assignment. 

For example, the following is a unique game with V = {v\, v 2 ,V3, v{\ and £ = {a, b, c}: 



" 3 = (acfe) M 
Vl = v 2 

V4 = b ) (V 2 ) 

\b c a J 

Where we use the notation | ,f . „!',. . ] to represent a function f : {a,b,c\ — > {a,b,c\. 

V /(a) f( b ) /(c) / 
The reader can verify that the value of the above unique game is 3/4. 

The unique games conjecture is that for every 7 > there is a a = 17(7) such that it is NP-hard to 
distinguish unique games of value > 1 — 7 from unique games of value < 7, even when restricted 
to instances where |S| < a and where the constraint graph is bipartite. 

For our application, we will need a variant of unique games, that we call (f-ary unique game. 
In a (f-ary unique game, a constraint is specified by a (f-tuple v\ . . . , of variables and a d- 
tuple of permutations fi,---,fd '■ ^ S. An assignment A : V — > S strongly satisfies the 
constraint if /i(^4(^i)), . . . ,/d(^4(fd)) are all equal; an assignment weakly satifies the constraint it 
if /i(j4.(i>i)), . . . , fd{A(vd)) are not all different. 

The following result is a rephrasing of a result by Khot and Regev [KR03] . 



Theorem 2 If the unique games conjecture is true, then for every d and every 7 there is a a = 
o~(d, , y) such that, given a d-ary unique game with alphabet size a, it is NP-hard to distinguish the 
case in which there is an assignment that strongly satisfies at least a 1 — 7 fraction of constraints 
from the case where every assignment weakly satisfies at most a 7 fraction of constraints. 



We define the strong value of an assignment to a d-ary unique game as the fraction of constraints 
that are strongly satisfied by the assignment. The strong value of a unique game is the largest 
strong value among all assignments. The weak value of an assignment and of a unique game are 
similarly defined. Note that the weak value is always at least as large as the strong value. 



3 Influence of Product of Functions 



In this section we prove a bound on the influence of a function of the form f{x) := fi(x) ■ 
f2(x) ■ ■ ■ fk( x ) m terms of the influence of the functions fj. Such a bound will be very useful 
in the proofs of our main results. 

In the boolean case, the bound is just a simple union bound. 
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Lemma 3 Let fi,...,fk '■ {0, l} n — ► { — 1,1} be boolean functions, and define f{x) = f\{x) ■ 

h(%) ■ ■ ■ fk{x). 

Then, for every i G [n] 

Inf,(/)<^Infi(/,) (10) 

j 

Proof: Using the formula for the influence of boolean functions, we see that 
InU(f) = Pr[f(x)^f(x + e t )] 

X 

= Pr[/i(x) • • • f k {x) h(x + a) ■ ■ ■ f k {x + a)] 

X 

k 

i / x 

3=1 

□ 

For general real-valued functions, we cannot hope to achieve the nice bound of Equation 10. Sup- 
pose, for example, that n = 1 and that all functions fj are defined as follows: fjifi) = 1 — e, 

fjQ) = i. 

Then, we have h(fj) = \e 2 . When we define f(x) := Yij fj(x), we get /(0) = (l-e) k and /(l) = 1, 
and so = j(l — (1 — e) fc ) 2 , which is about \k 2 e 2 for small e, or about A; times the sum of the 
influences of the functions fj. The following Lemma achieves such a tight bound. 

Lemma 4 Let fi : {0, l} n — > [—1, 1] 6e functions, and define 

f(x) :=fi(x)---f k (x) 
Then, for every i £ [n], Inf»(/) < k ■ X) j ^U(fj) 

PROOF: We begin by proving the following claim: 



Vai, . . . ,a k ,h, . . . ,b k G [-1, 1]. 



< 



^2\ai-bi\ (11) 



We prove Inequality 11 by first expressing the right-hand side as a telescoping sum of a sequence 
of "hybrids," and then by using the triangle inequality and the fact that all and bi have absolute 
value at most 1. 



K • a 2 ■ ■ -a k - &i • 6 2 • • • h\ 
= | (en • a 2 ■ ■ • a k ) - {b x • a 2 ■ ■ ■ a k ) + (h • a 2 • ■ ■ a k ) - (&i • b 2 ■ a 3 ■ ■ ■ a k ) + (bi ■ b 2 ■ a 3 • • • a k ) - 

h (6i • • • • a k ) - (6i • • • b k )\ 

= \(a 2 ■ ■ -a k ) ■ (ai - b\) + (b\ ■ a 3 ■ ■ ■ a k ) ■ (a 2 - b 2 ) H h (&i • • • 6fc_i) • (a fc - 6 fc )| 

< \a 2 ■ • • afc| • |ai - fei| + |fei • a 3 • • • ajfc| • |a 2 - 62 1 H h |6i • ■ ■ fefc-i| • \a k - b k \ 

< \ai - 6i| + \a 2 — b 2 \ -\ h \a k - b k \ 
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If we square both sides and apply Cauchy-Schwartz, we get 

< k • ^^(oj — b. 



2 



To summarize our progress so far, we have proved the following claim: 

Vcii,..., a k , &!,..., b k G [-1,1]. (ri ai_ n &i ) ^ k ■ ^2 ( a i ~ b i) 2 

\ i i / i 

We are now ready to prove the Lemma. For every x, using Inequality 12, we have 

(/(*) - f(x + ei)f = (n fi(x) - J] Uix + eA < k ■ J2(fj(*) - fj(* + e,)) 2 

\ i i J j 

and of course the same inequality remains valid if we take the average over x, so we have 
Inf t (/) = \eU(x) - fix + e t )) 2 < fcI E ^(/,(x) - fjix + e,)) 2 < k^Mfj) 



(12) 



4 x 

j 



□ 



4 Low Influence Implies Small Gowers Uniformity 

We are going to show that, for balanced functions / : {0, l} n — > { — 1, 1}, if U d if) is large, then one 
of the variables of / has high influence. 

Lemma 5 Let f : {0, l} n — > {—1, 1} be a function and d > 1 be an integer. Then 

U d if) < U\f) + i2 d ' 1 - lJmaxMiC/) 

PROOF:The case d = 1 is trivial. Let d > 2, and define / := maxjlnfj(/). We will prove 

U d if) < U d ~ l if) + 2 d ~ 2 I (13) 

which immediately implies the statement of the lemma. 
We write 



U d if) = E 

Xl,...,X d _ 2 



E /* ll ...,* d _ a (s)/x 1 ,..., !Bd _ a (a; + y)f xi ,...,x d _ 2 {x + z)f xlr „ )Xd _ 2 ix + y + z) 



x,y,z 



a 
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= e e ,, a (0) 

Xl,...,Xd_2 , "L Xl,...,X d _ 2 

We separately bound the two terms in the last expression. 

For every ai, . . . , a^_2 £ {0, 1}™, using Lemma 3, we get that, for every i E a, Infj(/ aij ... >ad 2 ) < 
2 d_2 InL;(/) < 2 d ~ 2 I and so we have that, for every ax,... , cy_ 2 G {0, 1}", 

< ^/l„,a d _ 2 («) < maxlnfi(/ 01) ..., ad _ 2 ) < 2 d ~ 2 / 

and so 

H,...,a;d_2 

Regarding the other term, 

E /4,...,x d _ 2 (0) < E „ 2 (0) E fE/x 1 ,...,x d _2(^y = C/ d - 1 (/) 

Xl,...,X d _2 Xl,...,X d _2 X!,...,X d _ 2 \X J 

□ 

The same argument also applies to general bounded real- valued functions. 

Lemma 6 Let f : {0, 1}™ — > [—1, 1] be a function and d > 1 be an integer. Then 

C/ d (/)<C/ 1 (/) + 4 d maxInf i (/) 

Proof: Define / := maxj Infj(/). It suffices to prove that, for d > 2, 

U d (f) < U d ~\f) + 2 M - 4 maxInf i (/) (14) 

i 

We repeat the proof of Lemma 5 verbatim, except that we use Lemma 4 instead of Lemma 3 to 
get an upper bound for Infj(/ 01) ... j0(J _ 2 ). Because of the worse bound in Lemma 4, we only get the 
bound 

lnU(f ai _ ad _ 2 ) < 2 M ~ 4 Inf 4 (/) < 2 M ~ 4 I 
and the rest of the proof proceeds with no change, except for the term 2 M_4 instead of 2 d ~ 2 . □ 

We remark that our bound for the boolean case is nearly tight. 

Lemma 7 For every fixed d > 2 and large n, there is a function f : {0, l} n — ► { — 1, 1} such that 
U\f) = o n (l), U d (f) = 1 and max, InU(f) < ^ + o n (l). 

Proof: Consider the function 

f(x u ...,x n ):= (-1)^1^-^-1+^-^-2+- 

□ 
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5 Low Cross-Influence Implies Small Gowers Inner Product 



The main result of this section is that if a collection of functions has small cross-influence, then it 
has small Gowers inner product, provided that at least one of the functions is balanced. 



■1,1] such that 



Lemma 8 For d > 2, let {fs}sc[d] be a collection of functions fs '■ {0, 1} T 

• f[d] is balanced, that is, K x f\d]( x ) = 0; 

• 4XInfj({/s}) < e for every i. 

Then 

{{fs}) Ud <V~e-2°^ 



Before proving Lemma 8, we establish a variant of a result of Aumann et al. [AHRS01] which will 
be useful in the inductive step of the proof of Lemma 8. 



Lemma 9 For every four bounded functions fx, f%, f 3 , fi : {0, 1} T 



■1,1], 



E h{a)h(a)h{a)f4(a] 



< 4maxmin{|A(a)|, |/ 2 (a)|, \f 3 (a)\, |/ 4 (a)|} 



Proof: Let 



e := maxmin{|/i(a)|, |/ 2 (a)|, | /3 (a) 1,1/4 (ck) I } 



For i = 1, ... ,4, let Si be the family of all sets a such that |/i(a)| < s. By definition, the union of 
the families Si contains all subsets a C [n]. We can then write 



J2fi{a)f2(a)f 3 {a)f 4 (c 

a 

< ^\h(a)\-\h(a)\-\f 3 (a)\-\U(a) 



< EE l/i(«)|-|/a(a)l-lA(«)|-|/i(«)l 

< e(E • • + E iA(«)i • • iAh 

\aeSi a£S2 

+ E lA(«)l • • + E lA(«)| • |/ 2 («)| • |/ 3 (a) 

< 4e 
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The last inequality follows from the fact that for every three functions f,g,h : {0, l} n — ► [—1, 1] we 
have 

]T |/(a)||$(a)||£(a)| < £ |/(a)||$(a)| < / 2 (a) 5 2 («) < 1 

a a y a y a 

□ 

We proceed with the proof of our main result of this section. 
Proof: [Of Lemma 8] We want to prove 

{{fs})u*<T(e,d) (15) 

for a function r(e, d) = \fe ■ 2°( d > that we specify later. 
We proceed by induction on d. 

The case d = 1 

For d = 1, we have two functions fq,, /m such that Ex f{i}(x) = and we want an upper bound to 
({/sDf/i. We see that 

(h,f{i})m = (f/ (x)) • (e /{!}(!/)) =0 
We have proved the base case of Equation 15 with r(e, 1) = 0. 

The inductive step 

Suppose now that, for d > 1, the lemma is true up to dimension d, and we want to prove it for 
dimension d + 1. 

We have 2 d+1 functions {fs}sc[d+i} an d we want to upper bound the Gowers inner product 

<{MW= E n /s (* + !>) (16) 

x ' Xl '-' Xd+1 sc[d+i] \ Its ) 

For every x\, . . . , a^-i, define the four functions 4 

8C[d-l] V ieS / 

B Xl ,...,x d _ 1 {%) 



SfC[d-l] \ i£S 



4 The case d = 1 is somewhat degenerate: [d — 1] is the empty set, and xi, . . . , x^-i is an empty sequence. So we 
simply have A := fy, B := /{i}, C := /{2} and 7? := /{i,2}, with no subscripts. 
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C Xl ,..., Xd _ 1 {x) := Yl fsu{d+i} I x + Yl Xi ) 
sc[d-x] V ies / 

■= n fsu{d,d+i} [ x +^z x i) 



with this notation, we can rewrite the expression (16) as 



({fs})u d + 1 = E ^i,...,x d _ 1 (^)^i,...,x d _ 1 (^+Sd)C Xli ... iXd _ 1 (x+x d+ i)I) Xli ... iXd _ 1 (x+j; d +x d+ i) 

x,a;i,...,a! d+1 



and using the Fourier expansion and simplifying, 



{{fs})u*+i = E V 4,..,^! (a)4 lr .. A.ja)^,...,^! a-i ( a ) 

Slj.-.jXd-i — 



E V ix!,...^.! (a)-B Xl! ... )a:d _ 1 (a)^,...,^.! (a)D xu ... tX (c 
+ E i, 1 ,...,x d _ 1 (0)5 Xl ,...^_ 1 (0)4 1 ,..., a:ti _ 1 (0)^ 1 ,...,, d _ 1 (0) 

Xl,...,X d _ 1 



We bound the two terms separately. 

For the first term, we have that, for every fixed x = (x±, . . . , Xd-i), 

< 4maxmin{|i x (a)|, |S x (a)|, |Cx(a)|, |Ac(a)|} 

< 4^maxmin{Infi(A x ), Infi^x), Infi(C x ), Infi(D x )} 

And we observe that 

mm{lnU(A x ),Inii(B x ),lnU(C x ),lnU(D x )} < 2 2d ~ 2 e (17) 

by using Lemma 4. To verify this claim, 5 let 5 be the minimum in the above expression. Then 
Infi(.Ax) > <5; recall A x is defined as a product of 2 d ~ 1 functions of the form fs(x + J2s Xi )' 
S C [d — 1], where we think of the Xj as constants, and so for at least one S C [d — 1] we have 
Infj(/s) > 5/2 2d ~ 2 . We argue similarly for £> x , C x and D x , and we find three more functions for 

5 Again, the d = 1 case is degenerate but easy to check: Equation 17 reduces to 

iniii{Infj(/( ! |),Infi(/ {1 }),Iiifi(/ {2} ),Ii>fj(/ {li 2})} < e 

which is precisely our assumption that the 4-cross influence of the functions is at most e. 
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which coordinate i has influence at most 5/2 2d 2 , and these four functions are distinct. Therefore, 
e>4XInf,({/ s }) >5/2 2d ~ 2 . 

So we have that, for every fixed x = (x±, . . . , Xd-\) 

^i x (a) J B x («)C x (a)D x (a) < aV^^e 

and so, in particular 

eY A x (a)B x (a)C x (a)D x {a) < 4V2 2d ~ 2 e 

x — ' 

The second term can be written as 

Eix(0)^x(0)C , x(0)£'x(0) 

X 

= E A x (x)B x (x + y)C x (x + z)D x (x + w) (18) 

x,x,y,z,w 

< E A x (x)B x {x + a)C x {x + b)D x {x + w) (19) 

where a, b are values for y, z that maximize the expectation in Equation 18. 
If we expand the Expression in (19), we get 



(20) 



Ex,x lr .. A _ lim IIsc[d-l] fs {x + Ei 6 5 X i) fsu{d} {x + a + £ igS Xi) 

/su{d+i} (a; + b + Eies x *) /su{d,d+i} {x + w + X) ie5 

which we are going to re-write as the d-dimensional Gowers inner product of a new set of functions, 
so that we can invoke the inductive hypothesis. For every S C [d — 1], define 

9s(x) := fs(x) ■ fsu{d}( x + a) ■ f S u{d+l}{x + b) 

and define 

9su{d}(x) ■= fsu{d,d+i}( x ) 

Expression (20) becomes 



E 

X,Xl,...,X c l_i,W 



n 9s I x + ^2 Xi J • g SU {d} U + ^Xi + w] 
sc[d-i] V es J \ ies J 

E [ gs Ix + y^Xi) = ({gs}}ui 

u - Xd - 1 ' Xd seta V es / 



after the change of variable w — > x^,- 

By definition, the function gy\ = fyd+i] is balanced, and, by construction and by Lemma 4, the 
4-cross-influence of the functions gs is at most 9e. So we have U d ({gs}) < t (9e,d). 
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We have thus solved the case of dimension d + 1, with 

r(e, d + 1) = 2 rf+1 • + r (9e, d) 

Together with the base case 

r(e, 1) = 

the recursion gives r(e, cZ) = 2°^ • yfe. □ 

6 Generalizing to Complex- Valued Functions on Arbitrary 
Groups 

In this section we generalize our result to the setting of functions / : G\ x • • • x G n — > C where 
each Gi is a finite abelian group. 

We fix a group G = G\ x • • • x G n for the rest of this section. 

We write group operations in G and in the groups Gi additively. We will recover the results of the 
previous sections when each Gi is Z2. We denote an element of G as a tuple g = (g%, . . . ,g n ) where 
gi is an element of Gi. The zero element of G is the tuple = (0, . . . , 0). 

If z = a + bi is a complex number, we define its conjugate z := a — bi and its absolute value 
\z\ := V a 2 + b 2 = V 'z -z. 

6.1 Complex Random Variables 

If X is a random variable that takes on finitely many complex values, and n(x) is the probability 
that X takes value x, then the average of X is the complex number 

E[X] :=^/i(a;)-x 

and the variance of X is the real number 

\ar[X] := E[\X - E[X}\ 2 } . 

6.2 Fourier Analysis 

A function x '■ Go — > C is a character of a group Go if x(0) = 1 an d x( a + 6) = x( a )x(^) f° r every 
a,b £ Go. We also have x(~ a ) — x( a )- 

It is well known that a finite abelian group Go has precisely |Go| characters, and that there is an 
isomorphism between Go and the set of characters that associates to each group element g precisely 
one character \g so that Xo is the constant 1 function and XgXh = Xg+h- 

For each group Gi, let {x g }geGi be the characters of Gj, indexed according to the above isomor- 
phism. 

For each g = (g±, . . . , g n ) define the function Xgi,...,g n '■ G ^ C 
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Xgi,...,g n { x li ■ ■ ■ i x n) '■— Xg;( 



iefn] 



Then the functions Xg are the characters of the group G. 
Define the following inner product among functions /, h : G — > C: 



(f,h) :=Ef(x)h(x) 

X 

Then one can verify that the functions Xg are orthonormal with respect to the inner product, and 
form a basis for the space of functions / : G — > C. Every such function can be written as 



gee 



where /(g) = (f,x s )- 

We again have Placherel's identity 



E|/(x)| 2 = ^|/(g 



g 



and we observe that 



/(0)=E/(x) 



6.3 Influence of Variables 



The following definitions could be given in a much more general setting, but the following will 
suffice for the purpose of this paper. 

Let Si, . . . , E n be finite and / : Si X ■ ■ ■ x E„ -> C be a function, then the influence of i for / is 
defined as 



Infi(/) := E 

3?l,...,Xj_i,X a; -|_l,...,X7i 



Var[/(xi, . . . ,x n) 



where the variables Xi are mutually independent, and each Xi is uniformly distributed over Sj. The 
reader should verify that if each Sj = {0, 1} and if / takes only real values, then we recover the 
definition we gave earlier. Note that the influence is always a non-negative real number. 

Returning to our setting of functions / : G\ x • • • x G n — > C, we have the following fact: 

Inf i (/)= ]T \f( gi ,...,g n )\ 2 

(9iv,9n):ft^0 

and, in particular, 
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maxlnfj(/) > max|/(g)| 2 

i g^O 

6.4 Influence of Products of Functions 

The following result is the only real difficulty in generalizing our results from previous sections. 
(The rest just follows the same proofs with some changes in notation.) 

Lemma 10 Let f, g : Si x • ■ • x E n — ► C be functions such that \f(x)\ < 1 and \g(x)\ < 1 for every 
x. 

Then, for every j £ [n], 

lnU{fg)<Z-{lnUU)+^U{g)) 
To prove the lemma, it is enough to prove the following bound 



Lemma 11 Let f, g : £ 

Then, 



C be functions such that \f(x)\ < 1 and \g(x)\ < 1 for every x. 
Var[/(s) 5 (x)] < 3 • (Var[/(x)] + Varfo(x)]) 

X XX 

Proof: Recall that Var x [/(a?)^(x)] := K x \f(x)g(x) - E y [f(y)g(y)}\ 2 - For every x, 



f(x)g(x)-E[f(y)g(y)] 
y 



f(x)g(x) - E[f(y)]g(x) + E[f(y)]g(x) - E[f(y)g(y)] 

y y y 



g(x) ■ f(x) - E[f(y)} + E [f(y) ■ (g(x) - g(y))] 



< \g{x) 



f(x)-E[f(y)] 



+ E[\f(y)\\g(x)-g(y)\] 



< 



y 



+ E[\f(y)\\g(x)-g(y)\} 
y 



We can now compute the variance of fg: 



Var[f(x)g(x)} := E 

X X 

< E 

X 

< 2E 

X 

= 2E 



f(x)g(x)-E[f(y)g{y)] 



f(x)-E[f(y)} 



+ E[\f(y)\\g(x)-g(y)\) 



f(x)-E[f(y)} 



f(x)-E[f(y)} 



+ 



E[\f(y)\\g(x)-g(y)\] 



+ 2EE [\f(y)\ 2 \g(x)-g(y)f 

x y 



< 2Var[/(x)] +4Var[ 5 (x)] 
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Where the last step follows from the fact that 



E \g{x)-g(y)\ 2 = E g{x) - E[g{z)} + E[g(z)} - g{y) 

x,y x,y z z 

Similarly, we could prove 



< 2E 

X 



g(x) - E[g(z)] 



+2E 

y 



E[g(z)} - g(y) 



Vax[f(x)g(x)] < 4Var[/(x)] +2Var[ 5 (x)] 

X XX 

And the average of the two bonds gives us the desired result. □ 
A simple induction shows that 



ln{ i (f 1 ---f k )<3-k lo ^ 3 -J2 1 ^i(f j ) 



6.5 Gowers Uniformity 

The definition of U d for complex-valued functions is as follows: for d > 1 and / : G — ► C, 



U d (f) := E 

X,Xl,...,X d 



K SQ[dUS\ 



II /(*+£*))■( II t(*+Y,«) 

US\ even \ ieS / / \5C[d],|S| odd \ i£5 / 



The inductive definition of U d is perhaps simpler. For a function / : G 
Xi, . . . , Xd E G, define f xi ,...,x d '■ G — > C inductively as follows: 



and elements 



fxi,...,x d (x) '■— fxi,...,x d ^i( x )fxi,...,x d -i{x-\-X c l) 



and then define 



U d (f) := E rM) 

x,x 1: ...,x d 

It is possible to show that U d (f) is always a non-negative real number, because we have 



U d {f) = E fx 1 ,...,x d _ 1 {x)fx 1 ,...,x d . 1 (x + x d ) 

IE (E/x 1 ,...^ d _ 1 (a;)) (E (a:)) 

Xl,...,Xd-l \x J \x J 



E 

X 1 ,...,X d - 1 



Ef xi , 



Explicit formulas for the case d = 1, 2 are: 
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U\f) = E f(x)f(x + y) = (E/(x)) -E/(x)) = |E/(x)| : 

xx x 



and 



f/ z (/)= E f(x)f(x + y)f(x + z)f(x + y + z) = V /(g)/(g)/(g)/(g) = ^ |/(g)| 4 
6.6 Gowers Uniformity and Influence 

Theorem 12 Let / : G\ x • • • x G n — > C 6e a function such that \f(x)\ < 1 /or ei>er?/ x. Then, for 
every d > 1, 

U d (f)<U 1 (f) + 2°W maxMi(/) 

Proof: The case d = 1 is trivial. For d > 2, it suffices to prove 

£/ d (/) < + 2 cd maxlnf i (/) 

i 

for an absolute constant c. Let I := maxj Infj(/). 
We follow the proof of Lemma 5 and write 



U d (f) = E 

Xl,...,X d _ 2 



E f xi ,...,x d _ 2 {x)f xi ,...,x d _ 2 {x + y)f xi ,...,x d _ 2 (x + z)f xl ,..., Xd _ 2 {x + y + 



x,y,z 

4 



Xl,...,X d _ 2 

g 



e Ei/^.-,^- 2 (g)i 4 + e liw^wr 



Xl,...,X d _ 2 *— ' Xi,...,X d - 2 

We separately bound the two terms in the last expression. 

For every a\, . . . , dd~2 €E G, and every g = (fli, • • • , g n ) / 0, using Lemma 10, we get that, for every 
i : gi 7^ 0, Infj(/ air .. )ad _ 2 ) < 2 cd Infj(/) < 2 cd I for an absolute constant c, and so we have that, for 
every ai, . . . ,a d _ 2 G G, 

£ l4,.,«,- 2 (g)| 4 < max|/ ai ,..., ad _ 2 (g)| 2 < maxlnf 4 (/ ai ,..., ad _ 2 ) < 2 cd / 

g^O 

and so 

xi,...,x d _ 2 ~f 

Regarding the other term, 

2 



E |/, 1 „.„ a;(i _ 2 (0)| 4 < E \fx u ...,x d _ 2 (0)\ = E 

Xl,...,X d _ 2 Xi,...,X d _2 Xl,...,X d - 2 



E f Xl ,...,x d _ 2 (x) 



u d -\f) 



□ 
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6.7 Gowers Inner Product 

Let {fs}sc[d[ be a collection of functions fg : G — > C. Then their Gowers inner product is the 
complex number 



({fs})u* ■■= IE 

X,Xl,...,X d 



SC[d]:\S\ 



\:\S\ even V iSS / 5C[d]:|5| odd V ieS 



(21) 



6.8 Gowers Inner Product and Cross-Influence 

We generalize Lemma 8 to the case of products of arbitrary groups. 

Lemma 13 For d > 1, let {fg}gc[d] be a collection of functions fg-GiX---x G n — > C such that 

• \fs(%)\ < 1 f or every S and every x; 

• f[d] i s balanced, that is, ~E X fu] (%) = 0; 

• 4XInfj({/s}) < e for every i. 
Then 

\{{fs})uA<V~e-2°^ 

We give an outline of the changes needed to adapt the proof of Lemma 8. 

First, we need the following bound, whose proof is identical to the proof of Lemma 9. 

Lemma 14 Let /i , /2 , / 3 , / 4 : G — > C be functions such that \fi(x)\ < 1 for every i and every x. 
Then 

£ lA(8)l • |A(S)I • l/s(8)l • 1/4(8)1 ^ 4maxmin{|A(g)|, |/ 2 (g)|, |/ 3 (g)|, |/ 4 (g)|} 
g 

As in the proof Lemma 8, we proceed by induction on d and prove that, under the hypothesis of 
the lemma, ({fg})uA < T~(e,d) for a function r that satisfies r(e,d) = y/e2°^. For d = 1, We 

have (fdf, = (Ex fd( x )) ' ^Ey /{i}(y)) — 0, and so we have the base case with r(e, 1) = 0. 

For the inductive step, we consider the {d + l)-dimensional Gower inner product 



{{fs})u*+* = E 

x,x 1 ,...,x d+1 

It is easier to rewrite it as 



n /* u+5> • 

SC[d+l]:|5| even V «6S / 5C[d+l]:|5| 



n fsU+^xA 

■11:151 odd V ieS 1 / 



(22) 



({/sLW = E 

X,Xl,...,X d+ i 



n ^(*+i>) 

5C[d+l]:|S| i€S / 



(23) 
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where we define F$ := fs if \S\ is even and F$ := fs if \S\ is odd. 
For every x\, . . . , x^-i, define the four functions 



^,...,^(2;):= Yl Fslx + ^Xi) 

SC[d-l] V ieS / 



D 



Xl,...,X d - 



i0) : = JJ F su{d} I x + £ Xi J 
sc[d-i] ieS / 



^1 1 ■ ■ -i^d- 



D 



Xl,...,Xd 



-i( x ) : = n F su{d+i} ix+^xi 

SC[d-l] V ieS y 

-1 0*0 : = II F 5U{d,d+l} {x + ^Xi 

sc[d-i] V ieS 1 



with this notation, we can rewrite the expression (23) as 



x,a:i,...,a: [ j + i 

and using the Fourier expansion and simplifying, 



\({fs}) 



E V^xCg^feAfejAcCg) 

x=(a;i,...,x d _ 1 ) 



g 



< 



E Mu)B x (g)C x (g)D x (g) 

x=(xi,...,x d _i) 



g^(0,...,0) 



+ 



E A x (0, . . . , 0)B X (0, . . . , 0)C X (0, . . . , 0)D X (0, . . . , 0) 

x=(xi,...,a: d _ 1 ) 



We bound the two terms separately. 

For the first term, we have that, for every fixed x = (xi, . . . , x^i) 



£ ^x(g)5 x (g)c x (g)D x ( g ) 

g^(o,...,o) 



< ]T |ix(g)|-|^x(g)|-|C x (g)| -|Ac(g)| 

g^(o,...,o) 

£ |ix(g)|-|Sx(g)|-|Cx(g)|-|^x(g)| 
g^(0,...,0) 

< 4 / max min{Inf • ( A x ) , Inf ; (B x ) , Inf • (C x ) , Inf j (D x ) } 
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And we observe that, for every x = (x±, . . . , x<i-i) and every i, 



iniii{Iiif i (A x ),Iiif i (B x ),Iiif i (C' x ),Iiif i (Z) x )} < 2 cd e 

for some absolute constant c. 

The second term can be written as 



E4(0,..-, 0)5 X (0, . . . , 0)C X (0, . . . , 0)D X (0, . . . , 0) 



E A x (x)B x (x + y)C x (x + z)D x (x + w] 

x,x,y,z,w 



(24) 



< E 
y,z 



< 



E A x (x)B x (x + y)C x (x + z)D x (x + w) 



E A x (x)B x (x + a)C x (x + b)D x (x + w) 



(25) 



where a, 6 are values for y, z that maximize the expectation in Equation 24. 
If we expand the Expression in (25), we get 

E x , Xl ,..., Xd _ 1<w Usc[d-i] F s ( x + EiGS x i) F su{d} {x + a + x i) Qg) 

Fsu{d+i} {x + b + Yl ieS x i) F su{d,d+i] {x + w + Y ieS x^ 

which we are going to re-write as the d-dimensional Gowers inner product of a new set of functions, 
so that we can invoke the inductive hypothesis. For every SC [d — 1], if S is even, define 



g s (x) := F s (x) ■ F Su{d} (x + a) ■ F Su{d+1} (x + b) 



and define 

If S is odd, define 

and 



9su{d}i x ) ■= F SU {d,d+i}{x) 



g s (x) := F s (x) ■ F Su{d} (x + a) ■ F Su{d+1} (x + b) 



9su{d}i x ) ■= Fsu{d,d+i}( x ) 
Expression (26) becomes {{gs})u d after the change of variable w — > x d . 

By definition, the function grji is either fu+i] of fu+i] and, in either case, g\& is balanced. By 
construction and by Lemma 10, the cross-influence of the functions gs is at most c'e for an absolute 
constant c'. 

This proves the inductive step with r(e, d + 1) = 4 • V2 cd e + r(c'e, d — 1) where c, c' are absolute 
constants. Together with r(e, 1) = we have r(e, d) = 2°^y^e as desired. 
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7 A Tight Analysis of Linearity Testing 



Consider the following promise problem. Given a function / : {0, l} n — > { — 1, 1} and a small e > 0, 
we want to distinguish the two cases 

1. / is linear; 

2. U d {f) < e. 

We refer to such a test as a "relaxed linearity test of degree (d — 1)." As usual, we say that a 
test has completeness c and soundness s if the test acceptes with probability > c in case (1) and 
with probability < s + s' in case (2), where ef — > when e — > 0. If a test makes q queries and has 
soundness s, then its amortized query complexity is q = (//logs -1 . 

For d = 2, this problem is the linearity testing problem. For d = 3, the only functions such that 
U s (f) > £ are functions that are correlated with degree-2 polynomials [Sam05], and so the test 
is required to distinguish linear functions from functions that are far from being quadratic. For 
d > 4, it is conjectured that the only functions with U d (f) > e are those that are correlated with a 
degree-((i — 1) polynomial, and, if so, such a test distinguishes linear functions from functions that 
are far from low-degree polynomials. By our results, such a test also distinguishes linear functions 
from functions where all variables have low influence. 

We give a tight analysis of the error probability of such tests for a given number of queries. 
7.1 The Linear Gowers Inner Product 

For the sake of our analysis of the Hypergraph Test, it is convenient to study expressions of the 
following form. Let {fs}sc[d\ be a collection of 2 d functions f s : {0, l} n -> [-1,1], d > 1, and 
define their linear Gowers Inner Product as 



({fs})LU* ■■= 




As usual, an empty sum is zero. For example: 



</0,/{i}>£[/i := E/ (O)/ {1} (x) 

(/0,/{1},/{2},/{1,2})lC/2 != E ft$(0) f {1}( X ) f {2} (V)f {1,2} ( x + y) 

Where = (0, ... ,0) is the all-zero vector of {0, 1}". We call it a linear inner product because, 
for functions fs ■ {0, l} n — > K, the Gowers inner product is defined by picking at random an affine 
subspace of {0, l} n of dimension d, and then taking the product of the functions on all points of 
the subspace. In the above expression, we do something similar but on a linear subspace. 

We prove that if the linear Gowers inner product of a collection of functions is large, then the 
regular Gowers inner product of a related collection of functions must also be large. 
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Lemma 15 Let fs '■ {0, l} n — > [—1, 1] be functions, S C [d], and define the collection {(/tItcm as 
9T = hu{d} ■ Then 



l({MWI<VW 



Proof: 



\({fs}) 



LU d \ 



< 



< 



u '"* d sc[d\ \ieS I 



E 

\ Xl,...,Xd-l 



n fs[j2 xii 

,SC[d],dgS \ieS 



\ 



E 



;X d _! \ X 



e n M*+ e 



E /ff [ V + £ x< 



SC[d],deS 



ieS-{d} 



sc[d],des 



ieS-{<2} 



E 

x,y,xi,...,xd- 



1 TC[d-l] \ ieT / \ i£T ) 



after the change of variable y — > x + x&. □ 



Lemma 16 Let fs ■ {0, l} n — > [—1,1] fre functions, S C [d], swc/i i/iai ({/s})z,[/<* > e arad 
Ex / M (ac) =0. 

JTien i/iere is a variable i such that XInfj({/s}) > e 4 /2°( n ). 
Proof: Define gx '■= fru{d}- Then we have 

1- Ex9[d\(x) = Exf[d](x) = 0, 

2. ({g T }) ud > «{/*} W) 2 > e 2 - 

From Lemma 8 we derive that there is a variable i such that 4XInf j({<?r}) > E 4 /2°( rf ) . Each function 
fs occurs at most twice in the collection {gr}, and so it must be Xlnf^d/g}) > 4XInf i{{gr}) > 
e 4 /2° (d) . □ 
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7.2 Positive Results on Relaxed Linearity Testing 

Given a hypergraph H = (jk],E), we can define a relaxed linearity test associated with H by 



iJ-Test 

choose uniformly at random in {0, 1}" 

accept if and only if 

Then we have the following result. 

Theorem 17 Let d > 2 and let H = ([k],E) be an hypergraph such that each edge of H contains 
at most d vertices. 

Them the H-Test is a degree-(d — 1) relaxed linearity test of completeness 1 and soundness at most 
l/2l E L 

We remark that this result was first proved in [Sam05], using a different approach. 

For the proof of Theorem 17 and of results in the next section, it will be convenient to use the 
following "Vazirani XOR Lemma," whose proof is immediate. 

Lemma 18 Let X\, . . . ,X m be random variables taking values in { — 1, 1}. Then 

Pr[Xx = 1 A X 2 = 1 A • • • A X rn = 1] = ^ JJ^ 

SC[m] i&S 

Using Lemma 18, we see that the probability that the H-Test accepts a function / is equal to 

i£n(n /<*'))• /(!>') 

E'CEeeE Vise / Vise / 

=m + w\ ^ nfn /(*•>)■ /(£*•) 

E'CE,E'^e&E VtGe / XiSe / 

We will also need two results from [GowOl, GT04]. The first one is [GowOl, Lemma 3.8], and it 
states that for every collection {fs} of functions: 

({f S })m\<U{ ud (fs)) 1/2d 
s 

The other is [GT04, 5.7] and states that for every / and d > 2, 

U d -\f) < yju*tf) 
We now proceed with the proof of Theorem 17. 

Proof: [Of Theorem 17] It's clear that a linear function is accepted with probability 1. 
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If the H-Test accepts with / probability more than 1/2^1 + e, then there is a non-empty E' C E 
such that 

n (n -fin A ^ e ( 2? ) 

e€E' Vise / Vi6e / 

Let d! be the size of the largest edge in E' , and, without loss of generality, assume that the edge 
(1, . . . , d!) is in E' . Fix the variables x d +1 , . . . , x k to values that maximize (27). Then, (27) becomes 

n /s(e*.W 

SC[d'] \ieS / 

where /g is the constant function equal to the product of the terms of (27) that depend only on 
x d +1 , . . . , x k , /{i}(x 1 ) is the product of the terms of (27) that depend only on x l and x d +1 , . . . , x k , 
and so on. In particular, fr d n (x 1 + • • • + x d ') = f{x 1 + • • • + x d> ). 
We thus have 



{{fs}) LUd > > e 

By Lemma 15, there are functions {gs} such that gun = f and ({gs})jjd' > £ 2 ■ 

Since all the functions involved are boolean, and thus their uniformity norms are at most 1, we 
conclude 

({gs}) ud , < min|[/ rf '( 55 ) < |Z7* (/) |V* 



e 2 < 



s 



d-d' 



And, finally, U d (f) > [U d ' '(/)) > e 2d+1 . □ 

ergraph on k vertices with at mos 

yd (k\ ■ 



In particular, taking H to be the complete hypergraph on k vertices with at most d vertices per 
edge, leads to a hypergraph linearity test with Yli=i (i) queries, and soundness — d 1 . The 



amortized query complexity of this test, for k S> d, is q < 1 + O ^ g(<i _ 1) / d 
Next, we show that no linearity test can do better. 

We do it in two steps. The first is to show that the amortized query complexity of any hypergraph 
test cannot be better than 1 + $7 (^ q ( d -i)/d ^j f° r Q queries. 

The second step builds on the first, and shows that any non-adaptive linearity test with perfect 
completeness cannot do better under this promise, namely it will have amortized query complexity 
of at least 1 + ( ^(d^Ttfd ) f° r Q queries. 



7.3 Lower Bound for the H-Test 

We prove a lower bound for the H-Test by describing an explicit function / : {0, l} n — > {—1, 1}, 
which has small d-th uniformity norm, and for any hypergraph H the acceptance probability of the 
if- Test on / is large. 
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Proposition 19 Fix d > 2 and fix an hypergraph H = (\k\,E) such that all edges have at most d 
vertices. For n a multiple of d, let f : {0, 1}" — ► { — 1, 1} be defined as 

f{x x , ...,x n ):= i \) ri ^ i'"-''^ 

Then U d (f) < 2~^( n ) ; and the H-Test accepts f with probability at least 

max jl/2 |£;| , 2-^=2©} 

The lower bound on amortized query complexity of hypergraph tests follows immediately, if we 
recall that the number of queries is \E\ + k, and consider the two cases: \E\ < Yli=2 (i)' or 

|£|>£i 2 (•)• 

Proof: We first show U d (f) to be small. 

Some additional notation: Let gi{x) = • • • an d let g(x) = x\X2---Xd+ 

%d+l • • • %2d H = Y!i=o~ l 9i- Let fi = {-l) g \ 

Then / = nrio'Vi and U d (f) = U?li~ X U d (fi) = {U d (f )) n/d . Thus it remains to show that 
U d (fo) is bounded away from 1. 

U d (f ) = U d ((-1) 90 ) = E (-l)£sc M 3o(*+£ l6S ^) 

XjX 1 ,...,x d 

We may assume that the variables x % live in {0, l} d , and then go is just the AND function. Therefore 
^2sc[d] 9o { x + Sie5 xl ) coun ts the number of times the complement 1 — x of x is representable as 
a linear combination of x 1 ...x d . This number is odd (= 1) iff x l ...x d are linearly independent and 
is even otherwise. Thus 

U d (fo) = 1 — 2Pr[x 1 ...i d are linearly independent] 

and this is easily seen (and well-known) to be a positive constant bounded away from 1. 

The proof that the acceptance probability of / is high closely follows the proof of Proposition 15 
in [ST00]. That proposition is a special case d = 2 of Proposition 19. We will repeat parts of the 
proof since many definitions need to be modified, and for completeness, but will omit proofs of 
intermediary steps if they are similar. 

First, the probability of / to be accepted is 



nn xy 

Te5ieT VieT 



(28) 



In order to simplify this expression, we need to introduce some notation. Let J- = {{1}, {&}} U 
E := {F\...F q ~\, be a family of all the vertices and the edges of H, viewed as subsets of {1, ... , k}. 
Let A be a k x q zero-one matrix whose q columns are given by Fi...F q , which we view as 0, 1 
vectors of length k (in particular, the first k columns of A form the k x k identity matrix). Let ut, 
for T £ E be a zero-one vector of length q which is 1 if Fi is either Tor a singleton, corresponding 
to a vertex than T passes through; and otherwise. 
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For K C 2W, let E (/,ft) := E^,...,^ [EUt* / (Eie** 1 )] ■ 

Let £/ = Span (ut)x£E ^ e a i-dimensional subspace of 7^ then, for a boolean /, (28) is 



SCE X '•••' a: 



n / e 



1 

2* 



u={u{l),...u(q))&U 
1 

2* 



i E 



UT (i)=l 

n /(£*' 

i : u(i)=l UeFj 



^ :u(i) = l}). 

u=( u (i),...«( g ))ef/ 



(29) 



We will show many of the terms E(/, {-^i : u(i) = 1}) are 1. 



Definition 20 A family TZ C 2^ is an "even cover", iff every subset T C [fc], |T| < d is covered 
an even number of times by the sets R £ 1Z. 

The proofs of the following three lemmas are easily adaptable from the proofs of the corresponding 
statements (Lemmas 17,18,19) in [STOO]. 

Lemma 21 Let TZ C 2^1 If TZ is an even cover, than E(f,TZ) = 1. For any TZ, E{f n ,TZ) > 0. 
Proof: Omitted. □ 

Lemma 22 The number of vectors u £ U , such that the family TZ = {Fi : u(i) = 1} is an even 
cover, is at least max |l, 2*~^ 4=2 

Proof: Omitted. □ 



Lemma 23 



i < |E|. 



Proof: Omitted. □ 

Proposition 19 now follows from lemma 21, lemma 22, 
lemma 23 and (29). □ 
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7.4 Lower Bounds for Arbitrary Test 



We now pass to our more general result, that holds for any test, including tests with completeness 
smaller than 1. Recall that a test making g-queries is called non-adaptive if it makes the q queries 
simultaneously, instead of using the answer to the first query to decide how to make the second 
query, and so on. 



Theorem 24 let T be a non-adaptive test that makes q queries and that is a degree (d — 1) relaxed 
linearity test with completeness c and soundness s for functions f : {0, l} n — ► {—1, 1}. 

Then 



l-c + s>2 W d ' 1)/d , 
Equivalently, the amortized query complexity of T is at least 1 + ft Q (d J L 1) y d ^ . 

Proof: We will show that (1 — c) + s is at least e = ejj — 2~ n ( n \ where Eh is the best error 
achievable by a hypergraph test after q queries. The theorem will then follow from Proposition 19. 

By von Neumann's minimax theorem (also known as Yao's pronciple) it is enough to construct two 
distributions P and Q on boolean functions, P supported on linear functions and Q on functions 
with small U d norm, such that such that for any subset X = {x 1 ...x q } of the boolean cube the distri- 
butions P' and Q' induced by P and Q on {0, l} q by evaluating a function / ~ P (correspondingly 
/ ~ Q) on X are at most 1 — e' apart. 

For a function / : {0, l} n — ► R, and annxn matrix A over GF2 let f A be given by fA( x ) = f{Ax). 
Now, fix a non-zero linear function t, and let / = f(x\, . . . ,x n ) := (— \Y 1X2 "' Xd+Xd + 1 "' X2d+ "' . The 
distribution P is taken to be uniform over the functions {£a '■ A is invertible} and the distribution 
Q is taken to be uniform over the functions {/a : A is invertible}. 

Observe that P is supported on linear functions. In fact, P is uniform over all non-zero linear 
functions. On the other hand / has a small U d norm, by Proposition 19. The following lemma 
shows the same for the functions f^- 

Lemma 25 For a function f : {0, l} n — ► R, and an invertible n x n matrix A over GF2 holds 

U d (f) = U d (f A ) 

Proof: 

U d (f A )= E . IT f A [x + Yy) = E \\f[Ax + Y,Ax l 

i&S 

e d n /U+E x *) = ud ^ 

x > xl '-' xd SC[d] V i&S J 

The third equality follows from the fact that if a (d+ l)-tuple (x, x 1 , ...,x d ) is distributed uniformly 
in {0, l} n ( d+1 ), then so is the (d + l)-tuple (Ax, Ax 1 , Ax d ). □ 
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Since the distributions P and Q are invariant under invertible linear transformations, the induced 
distributions P 1 and Q' are determined by the linear structure of X, namely linear dependencies 
between x l ...x q . Let the linear rank of the vectors x 1 ,...,^ 9 over GF2 be k. We will assume 
x 1 , . . . ,x k are linearly independent, and x k+l = X^=i a ij x ^ f° r * = 1, . . . ,q — k. 
In the remainder of the proof we assume n 3> k and call an event negligible if its probability is 
exponentially small in n. 

The distributions P' and Q' are almost precisely modelled by the following experiments: Choose 
y 1 ,...,y k independently at random, set y k+l = X^=i a «j2/' J > for i = 1, . . . ,q — k, and return 
£(y 1 ), . . . ,£(y q ) or f(y 1 ),---,f(y q ) correspondingly. The only caveat comes from the negligible 
event that y 1 , ...,y k are linearly dependent. 

For n 3> k, the distribution P' is, up to a negligible probability, given by choosing the first k bits 
uniformly at random, and setting the other q — k bits according to specified linear dependencies. 

Consider the distribution Q'. By proposition 19 the probability that for i = 1, . . . , q — k holds 
f{y k+l ) = Ylj=i a ijf{v^) i s a * least ejj. Observe that in this case the g-tuple we deal with has no 
linear contradictions and thus belongs to the support of P' . Call such g-tuple "linear". We have 
just proved that, up to a negligible factor, 

Pr{z : z is "linear"} > ejj 

We have 

W P ' ~ Q'W = \ ■ E - = \ ■ E [max{P'(z),Q'(z)} - mm{P> (z) , Q' (z)}] < 

z z 

l-i.^min{P'(z),Q'(z)}<l-i. £ mm{P'(z), Q'(z)} 

z "linear" z 

To complete the proof of the theorem we observe that, up to negligible factors, P'(z) > 2~ k for all 
"linear" z, and 



Q'(z) = Q'{z u ...,z q )< Pr{/(yi) = z u f(y k ) = z k } = [JPr{/( yi ) = Zi } = T 

8=1 

)re 

mm{P'(z),Q'{z)} > P'{z) - 2~ n ^ >e H - 2~ n ^ 



"linear" z "linear" z 

□ 



8 The PCP Construction 
8.1 The Long-Code Test 

We say that a function g : {0, l} n — > {—1, 1} is a codeword of the Long Code (or, simply, is a long 
code) if there exists a coordinate i such that g{x\, . . . , x n ) = (— l) Xi , that is, if g = X{i}- Note that 
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if g is a long code then there is a coordinate that has degree-1 influence 1 for g. This is the extreme 
case of large low-degree influence for a bounded function. 

Given a collection of K balanced functions gi,...,gx : {0,1}™ — ► {—1,1}, we are interested in 
designing a test that distinguishes the following two cases: 

• The functions gj are all equal to the same long code, that is, for some i S [re] and for all 
je[K], 9j (x) = (-ir; 

• The degree-<i cross-influence of the collection gi, . . . , gx is less than than 5, for some small 5 
and large d. 

More formally, we have the following definition. 

Definition 26 (A'- Function Long Code Test) A test that is given oracle access to K functions 
gi,...,gK : {0,1}™ — > { — 1,1} is said to have soundness s and completeness c if the following 
conditions hold. 

• // the functions gj are equal to the same long code, then the test accepts with probability > c; 

• For every e > there is a r = r(e) > and d = d(e) such that if the test accepts with 
probability > s + e, then there is a variable of degree-d cross-influence at least r for the 
functions gj. 

Let H = ([t], E) be a hypergraph on t vertices. 

For < 7 < 1/2, define the distribution // 7 over {0, 1}™ so that M 7 (x) = ^ w ( x \l — ^ n ~ w ( x ) ; where 
w(x) is the number of ones in x. 

The 7-noisy .ff-test is a (t + |£?|)-function long code test defined as follows: 

7-noisy-H-Test ({g a } a e[t]uE) 

choose x,x\, . . . ,xt uniformly at random in {0, 1}™ 
for every j £ [t], sample rf from /i 7 
for every e 6 E, sample rf from // 7 
accept if and only if 

Ve E E. rL ee 9 (€ > {jf + a*) = g^ {r,^ + £ j£e x t ) 



Remark 2 In the definition of K -function long-code test, we index functions by integers 1, . . . , K, 
while in the definition of the Hypergraph Test we let the given functions be indexed by elements of 
[t] U E. We hope the reader is not too bothered by this abuse of notation. 

8.2 Analysis of the Hypergraph Test 

In this section we prove the following theorem. 

Theorem 27 For every hypergraph H = {\t],E), and every 7 > 0, the 7 -noisy H-test is a (t+\E\)- 
function long code test with completeness 1 — (t + 1)7|-E| and soundness 1/2' E L 
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The completeness part is clear. 

For the soundness part, as in Section 7.2, we can write the probability that the test accepts a given 
set of oracle functions as 

E'CE x ^-^w,v e y eeEiee \ iee j 

=w\ + ^ m e ,. .,nn» w W-^+e ' 



,7 ; 



and so if thest accepts with probability at least 2 1^1 + e there is a subset of tests E f C E such that 

e n n^u^+^o -s (e) K e) +i> ) > e 

*l,-,**.{^,'7 e } ee£ / i6e V iGe / 

It remains to prove the following lemma (E in the lemma plays the role of E' above). 
Lemma 28 Let gj : {0, l} n — > { — 1, 1} be functions and H = ([t], £?) 6e a hypergraph such that 



E.I | <? W fa* + *i) • 3 W I r/ (eJ + >^i| >e (30) 

Xl,...,xtj{»?*!»7 e } 

where the rj's are sampled according to T/ien i/tere is a variable i that has degree-d[e , 7) cross- 
influence at least 5(e, 7) > /or i/ie functions {g^ a '} a e[t]uE- 

Proof: Since the g( a ) map to {—1, 1}, we can write Equation 30 equivalently as 



- ^A3S^ + ^\¥/\^)) >s 

Where Odd denotes the set of vertices of odd degree in the hypergraph ([t], E). 
Now, define G{x) = Er? g(v + s), where rj is sampled from /i 7 . 
Then (30) becomes 

,iin« B 4(n^(E'.))> £ <»> 

' ' VeOdd / \ee-E Vise // 

Let A; be the maximum size of an hyperedge in E, and assume, without loss of generality that H 
has an hyperedge e = (1, 2, 3, ... , k). Fix the variables x^+i, ■ ■ ■ , x% in (32) in such a way that the 
average over x±, x% . . . , Xk is still at least e. In particular, (32) becomes 

E h(0)f{i}(xi)f{2}(x2)---G^(x 1 + x 2 + ... + x k )>e (33) 



Xl,Xl,...,Xk 
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Where f$(x) is the constant function equal product of all the terms in (32) that depend exclusively 
on the fixed variables x^ + i^, . . . , X(t), /{i} (xi) is the product of all the terms that depend only on 
x\ and on . . . ,Xt, and so on. 

Note that each fg is a product of shifts of functions G^ a \ Furthermore, the index sets of a's for 
distinct fg are disjoint. 

If we call /[fc] := G^ e \ then Equation 33 says that 

Choose 5 = 5(k, e) = £ 4 / 2 ° (fc) to be small enough so that Lemma 16 implies that there are two 
functions fg, fr, with S ^ T, and a variable i, such that Inii(fg) and Inf^/r) are both at least 5. 

By previous observations on the relation between the fs and the G^ and by Lemma 4, we have 
that there is a 5' and two functions G a , G b such that Infj(G a ) and Infj(G fe ) are both at least 8' . 

Consider now the Fourier transform of a function G{x) := E ?) 5'(?/ + x). It is easy to see that the 
Fourier coefficients of G satisfy G(a) = (1 — 2^f)^g(a). Therefore, for every degree bound d, we 
have 

Infi(G) = G\a) = £ (1 - 2 7 )N$ 2 (a) < (1 - 2 7 ) d + £ g\a) < (1 - 2 7 ) d + Inf^( 5 ) 
This means that we can get Inf^ rf (g^) , Inf^ rf (g^) > 6' /2 if we choose d = O (7" 1 log(5'~ 1 )) . □ 
8.3 Composition and PCP Construction 

The following theorem follows from Theorem 2 and Theorem 27 using standard techniques. 

We need a couple of definitions. If / : {0,1}™ — > { — 1,1} is a boolean function, then we define 
its folding as the boolean function / defined as follows: /(0, X2, ■ ■ ■ ,x n ) := f(0,X2,---,x n ) and 
/(l, x 2 , . . . , x n ) := -/(0, 1 - x 2 , ■ ■ ■ , 1 - x n ). 

The definition satisfies the following useful properties: (i) if / is a long code, then / = /, and, (ii) 
for every /, E x J(x) = 0, that is, 7(0) = 0. 

Let / : {0, l} n — ► R be a function, and ir : [n] —> [n] be a permutation in S n . Then we let 
/ o 7r : {0, 1}™ — ► R be the function such that / o ir(xi, . . . ,x n ) := /(a^-m, ■ ■ ■ ,x w f n \). Here the 
interesting properties are that, for every coordinate i and degree bound t, 

Inff ( ' i) (/) = Inff(/ovr) 

and that if / is the long code of i, g is the long code of j, and ir(i) = vr'(j'), then / o tt and g o it' 
are the same function. 

Theorem 29 (Main) Suppose that the Unique Games Conjecture is true. Then, for every 5 > 0, 
and for every q > 3, 

NP = PCP 1 _ W29+(5 [0(logn),< / ] 
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and, if we can write q = 2 k — 1, then 



NP = PCP 1 _a )( , +1)/2 , + a[0(logn),«] 

Proof: Let H = ([t],E) be an hypergraph such that t + \E\ = q and t < 1 + log 2 q. (Or 
t = log 2 (<7 + 1) if it is possible to write q = 2* — 1.) 

Fix a small constant <5, and consider the <5-noisy H-test. From Theorem 27 we have that there are 
constants t(5) and d(5) such that if {g a } a &\t]\jE are functions accepted by the test with probability 
at least 1/2' E\ then the degree-d(5) cross-influence of the functions is at least t(5). 
Let 7 be a constant smaller than 5t 2 /d 2 . 

From Theorem 2 we know that, if the Unique Games conjecture is true, there is a reduction from 
SAT to (/-ary unique games with the property that a satisfiable instance if of SAT is mapped into 
a unique game U v of strong value at least 1 — 7 and an unsatisfiable instance if of SAT is mapped 
into a unique game Uu, of weak value at most 7. 

A PCP for a formula (p is a long code for each of the variables of the unique game Uu,. Let n be 
size of the alphabet of U v . 

We consider the following verifier: 

• Pick at random a constraint of Up, say that it involves the variables vi,...,v q and the 
permutations 7Ti, . . . , Tr q . Let f%, . . . , f q be the functions written in the proof being checked 
and which, supposedly, are the long codes of assignments for the variables v 1, . . . , v q . 

• Run the 5-noisy H-test using the functions fi o 7ri, . . . , f q o ir q . 

Completeness of the test If <p is satisfiable, then consider the proof where for each variable v 
we write the function / that is the long code of A(v), with A() being an assignment that strongly 
satisfies a 1 — 7 fraction of constraints. Then the test accepts with probability at least (1 — 7) • (1 — 
q6) > 1 - (q + 1)6. 

Soundness of the test Suppose that the test accepts with probability at least 1/21^1 + 25. 
Consider the following randomized assignment: for every variable v , consider the function / written 
in the proof corresponding to v; consider the set of coordinates i such that Inf^ d (/) > r; if the set 
is empty, give v a random value, otherwise, give v a randomly chosen value from the set. Note that 
if the set is non-empty then it has size at most d/r. 

Call a constraint good if the if -test accepts with probability at least 1/2^1 +5 when that constraint 
is chosen by test. Then at least a 5 fraction of constraints is good. 

Consider a good constraint, and say that it involves the variables v±, . . . ,v q and the permutations 
7Ti, . . . , ir q . Let fi,...,f q be the functions written in the proof corresponding to the variables 

Vl, ...,V q . 

Since the constraint is good, there is a variable i that has degree-d cross-influence at least r for 
the functions fj o ttj. That is, there are two functions f a ,fb such that Inf^ rf (/ a o 7r a ) > r and 
lntf d (f b o 7Tb) > r. Then the randomized assignment described above assigns with probability at 
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least t jd the value 7r a to v a , and with probability at least r/d the value 7r fe (i) to When 
this happens, the constraint is weakly satisfied. 

Overall, the randomized assignments weakly satisfies on average at least a 5r 2 /d 2 > 7 fraction of 
constraints, which proves that ip was satisfiable. 

Conclusion Wrapping up, our PCP verifier has completeness at least 1 — q6 and soundness at 
most 1/2^| + 25. Since 5 was chosen arbitrarily at the beginning, the theorem follows. □ 

8.4 Inapproximability Results 

It is an immediate consequence of Theorem 29 that, assuming the unique games conjecture, Max 
fcCSP cannot be approximated within a factor larger than (k + l)/2 fc if k is of the form 2* — 1. It 
follows from Theorem 29 and the reductions in [TreOl] that, assuming the unique games conjecture, 
the Maximum Independent Set problem in graphs of maximum degree D cannot be approximated 
within a factor larger (log D) c /D, for sufficiently large D, where c is an absolute constant. 
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